Compression of double array structures for fixed length keywords

نویسندگان

  • Masao Fuketa
  • Hiroya Kitagawa
  • Takuki Ogawa
  • Kazuhiro Morita
  • Jun-ichi Aoe
چکیده

Trie is one of the data structures for keyword matching. The trie is used in natural language processing, IP address routing, and so on. It is represented by the matrix form, the link form, the double array, and LOUDS. The double array combines retrieval speed of the matrix form with compactness of the list form. LOUDS is a succinct data structure using bit-string. Retrieval speed of LOUDS is not faster than that of double array, but the dictionary size is smaller. This paper proposes a compression data structure of the double array by dividing trie into each depth and removing the BASE array from that double array. From experimental results, the retrieval speed was almost the same as double array, and the size of the presented method was the most compact in other methods including LOUDS for a large set of keywords with fixed length. Keywordstrie; double array; fixed length; compression

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Probing the Probabilistic Effects of Imperfections on the Load Carrying Capacity of Flat Double-Layer Space Structures

Load carrying capacity of flat double-layer space structures majorly depends on the structures' imperfections. Imperfections in initial curvature, length, and residual stress of members are all innately random and can affect the load-bearing capacity of the members and consequently that of the structure. The double-layer space trusses are susceptible to progressive collapse due to sudden buckli...

متن کامل

Experimental Observations of Construction Methods for Double Array Structures Using Linear Functions

A trie is an ordered tree data structure to store keywords. It is used in natural language processing and so on. The trie is represented by the double array. The double array can retrieve fast at time complexity of O(1). The double array using linear functions (DALF) is proposed as a compression method of the double array. DALF reduces space usage of the double array to 60%. DALF is built by us...

متن کامل

Universal source coding for complementary delivery

This paper deals with a universal coding problem for a certain kind of multiterminal source coding system that we call the complementary delivery coding system. Both fixed-to-fixed length and fixed-to-variable length lossless coding schemes are considered. Explicit constructions of universal codes and bounds of the error probabilities are clarified via type-theoretical and graph-theoretical ana...

متن کامل

Lossless Microarray Image Compression by Hardware Array Compactor

Microarray technology is a new and powerful tool for concurrent monitoring of large number of genes expressions. Each microarray experiment produces hundreds of images. Each digital image requires a large storage space. Hence, real-time processing of these images and transmission of them necessitates efficient and custom-made lossless compression schemes. In this paper, we offer a new archi...

متن کامل

Stridor in a Newborn with Double Aortic Arch-A Case Report

Introduction: Double aortic arch (DAA) is a congenital anomaly of the aortic arch. It is the most common type of complete vascular ring. When it occurs, the connected segment of the aortic arch and its branches encircle the trachea and esophagus, leading to symptoms related to these two structures. Case Report: We present a case of a newborn baby who developed biphasic stridor immediately after...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Inf. Process. Manage.

دوره 50  شماره 

صفحات  -

تاریخ انتشار 2014